Server, server control method, program and recording medium

ABSTRACT

A request receiver ( 101 ) receives a request in which image ID information and a parameter for image processing are specified, from a terminal. An image processor ( 102 ) acquires an image based on the ID information that was specified in the received request, and by using the parameter that was specified in the received request to apply the image processing on the acquired image, outputs extracted information that was extracted from inside the image. A response transmitter ( 103 ) transmits a response in which the extracted information that was outputted by the image processor ( 102 ) is specified, to the terminal. When the extracted information that is associated with the ID information and the parameter that were specified in the received request is cached, the image processor ( 102 ), instead of acquiring the image and applying the image processing, obtains the cached extracted information, and outputs the obtained extracted information.

TECHNICAL FIELD

The present invention relates to a server, a server control method, aprogram and a recording medium, and particularly, relates to thetranslation of character strings inside an image.

BACKGROUND ART

Conventionally, various translation tools have been known; however,there is a need for translation of WEB pages that have images in whichcharacter strings are included using those translation tools. In orderto translate character strings that are included in an image, first itis necessary to perform character recognition of characters that areincluded in an image. Then, after the characters have been identified bycharacter recognition, those characters are translated into the targetlanguage. However, the character recognition processing requires acertain amount of time, so that there was a problem in that translationin real-time was difficult. Various methods have been proposed to dealwith this kind of problem. For example, Patent Literature 1 discloses amachine translation system that associates and stores images thatinclude characters with character strings for translation, and whentranslating an image that includes characters, provides a translationresult of the character string.

CITATION LIST Patent Literature

-   Patent Literature 1: Japanese Patent No. 4658236

SUMMARY OF INVENTION Technical Problem

There is also a desire to translate characters in an image as is withoutpreparing character strings for translation.

The objective of the present invention is to solve a problem such asdescribed above by providing a server, a server control method, aprogram and a recording medium that are suitable for translatingcharacters of images in a WEB page in real-time.

Solution to Problem

A server according to a first aspect of the present invention, has

a request receiver that receives a request in which image ID informationand a parameter for image processing are specified, from a terminal;

an image processor that acquires an image based on the ID informationthat was specified in the received request, and by using the parameterthat was specified in the received request to apply the image processingon the acquired image, outputs extracted information that was extractedfrom inside the image; and

a response transmitter that transmits a response in which the extractedinformation that was outputted by the image processor is specified, tothe terminal; wherein

after the extracted information has been extracted, the image processorassociates and caches the extracted information with the image IDinformation and the parameter; and

when the extracted information that is associated with the IDinformation and the parameter that were specified in the receivedrequest is cached, the image processor, instead of acquiring the imageand applying the image processing, obtains the cached extractedinformation, and outputs the obtained extracted information.

In the server according to the aspect above,

the parameter that is specified in the request includes a specifiedarea;

the image processing by the image processor includes extractionprocessing that, by performing character recognition on the specifiedarea inside the image that is included in the parameter, outputs arecognized character string that was recognized from the specified areain the image;

after the recognized character string has been extracted, the imageprocessor associates and caches the recognized character string with theimage ID information and the specified area;

when a recognized character string that is associated with the IDinformation and the specified area included in the parameter that arespecified in the received request is cached, the image processor,instead of the character recognition, obtains the cached recognizedcharacter string, and outputs that obtained recognized character string;and

the extracted information includes the recognized character string.

Moreover, in the server according to the aspect above,

when the ID information that was associated when the recognizedcharacter string was cached matches the ID information that wasspecified in the received request, and the specified area that wasassociated when the recognized character string was cached overlaps thespecified area that is included in the parameter that was specified inthe received request, the image processor determines that the extractedinformation that is associated with the ID information and the specifiedarea included in the parameter that were specified in the receivedrequest is cached.

Furthermore, in the server according to the aspect above,

when the recognized character string is extracted, the image processorcaches the recognized character string after correcting the specifiedarea to a recognized area in the specified area where the recognizedcharacter string was recognized.

In the server according to the aspect above,

the parameter that is specified in the request includes a language forrecognition; and

the character recognition by the image processor is performed using thelanguage for recognition.

Moreover, in the server according to the aspect above,

the parameter that is specified in the request includes conversioninformation;

the image processing by the image processor includes conversionprocessing that, by performing a conversion processing on the outputtedrecognized character string using the conversion information that isincluded in the parameter that was specified in the received request,outputs a converted character string that is converted from therecognized character string; and

the extracted information includes the converted character string.

Furthermore, in the server according to the aspect above,

when the converted character string is extracted, the image processorassociates and caches the converted character string with the image IDinformation, the specified area, the language for recognition, therecognized character string, and the conversion information; and

when the converted character string is cached for the ID information,the specified area, the language for recognition and the conversioninformation that were specified in the received request, the imageprocessor, instead of the conversion processing, obtains the cachedconverted character string, and outputs the obtained converted characterstring.

In the server according to the aspect above,

the conversion processing obtains the converted character string bytranslating the recognized character string to a translation targetlanguage that is specified in the conversion information.

Moreover, the server according to the aspect above, further has

an inquiry receiver that receives an inquiry in which image IDinformation is specified, from the terminal; and

an answer transmitter that,

-   -   when the extracted information that is associated with the ID        information that is specified in the received inquiry is not        cached, transmits an answer in which a message is specified that        prompts transmission of the request in which the image ID        information and the parameter for image processing are        specified, to the terminal and    -   when the extracted information that is associated with the ID        information that is specified in the received inquiry is cached,        transmits an answer in which the extracted information is        specified, to the terminal.

Furthermore, the server according to the aspect above, further has:

an inquiry receiver that receives an inquiry in which the image IDinformation and a position inside that image are specified, from theterminal; and

an answer transmitter that, when a recognized character string that isassociated with the ID information that is specified in the receivedinquiry and a specified area that includes the position that isspecified in the received inquiry is cached by the image processor,transmits an answer in which the cached recognized character string isspecified, to the terminal.

The server according to the aspect above, further has:

an inquiry receiver that receives an inquiry in which image IDinformation, a position inside the image and the conversion informationare specified, from the terminal; and

an answer transmitter that, when the converted character string that isassociated with the ID information that is specified in the receivedinquiry, the conversion information, and the specified area thatincludes the position specified in the received inquiry is cached by theimage processor, transmits an answer in which the cached convertedcharacter string is specified, to the terminal.

A program according to a second aspect of the present invention is aprogram that is executed by a computer that has a display that displaysan image identified by the ID information on a screen and that iscapable of communicating with the server according to the aspect above,and causes the computer to function as:

a position detector that detects a selection of a position inside theimage that is displayed on the screen;

an inquiry transmitter that, when the selection of the position isdetected, transmits an inquiry in which the ID information and theselected position are specified, to the server; and

an answer receiver that receives an answer from the server;

the program causes the computer to function such that

the display further displays on the screen the extracted informationthat is specified in the received answer;

the program causes the computer to function as:

an area detector that detects a selection of an area inside the imagethat is displayed on the screen;

a setter that prompts setting the parameter for image processing;

a request transmitter that, when the selection of the area is detected,transmits a request in which the ID information, the set parameter, andthe selected area are specified, to the server; and

a response receiver that receives a response from the server; and

the program causes the computer to function such that

the display further displays on the screen the extracted informationthat is specified in the received response.

A control method for a server according to a third aspect of the presentinvention, has:

a request receiving step for receiving a request in which image IDinformation and a parameter for image processing are specified, from aterminal;

an image processing step for acquiring an image based on the IDinformation that was specified in the received request, and by using theparameter that was specified in the received request to apply the imageprocessing on the acquired image, outputting extracted information thatwas extracted from inside the image;

a response transmission step for transmitting a response in which theextracted information that was outputted is specified, to the terminal;wherein

the image processing step includes,

after the extracted information has been extracted, associating andcaching the extracted information with the image ID information and theparameter; and

when the extracted information that is associated with the IDinformation and the parameter that were specified in the receivedrequest is cached, instead of acquiring the image and applying the imageprocessing, obtaining the cached extracted information, and outputtingthe obtained extracted information.

A program according to a fourth aspect of the present invention causes acomputer to function as:

a request receiver that receives a request in which image ID informationand a parameter for image processing are specified, from a terminal;

an image processor that acquires an image based on the ID informationthat was specified in the received request, and by using the parameterthat was specified in the received request to apply the image processingon the acquired image, outputs extracted information that was extractedfrom inside the image; and

a response transmitter that transmits a response in which the extractedinformation that was outputted by the image processor is specified, tothe terminal; wherein

after the extracted information has been extracted, the image processorassociates and caches the extracted information with the image IDinformation and the parameter; and

when extracted information that is associated with the ID informationand the parameter that were specified in the received request is cached,the image processor, instead of acquiring the image and applying theimage processing, obtains the cached extracted information, and outputsthe obtained extracted information.

A recording medium according to a fifth aspect of the present inventionreadable by a computer and records a program that causes the computer tofunction as:

a request receiver that receives a request in which image ID informationand a parameter for image processing are specified, from a terminal;

an image processor that acquires an image based on the ID informationthat was specified in the received request, and by using the parameterthat was specified in the received request to apply the image processingon the acquired image, outputs extracted information that was extractedfrom inside the image; and

a response transmitter that transmits a response in which the extractedinformation that was outputted by the image processor is specified, tothe terminal; wherein

after the extracted information has been extracted, the image processorassociates and caches the extracted information with the image IDinformation and the parameter; and

when the extracted information that is correlated with the IDinformation and the parameter that were specified in the receivedrequest is cached, the image processor, instead of acquiring the imageand applying the image processing, obtains the cached extractedinformation, and outputs the obtained extracted information.

The program above can be distributed and sold by way of a computercommunication network independent from the computer that executes theprogram. Moreover, the recording medium above can be distributed andsold independent from the computer.

Here, a non-transitory recording medium is a tangible recording medium.A non-transitory recording medium is, for example, a compact disk, aflexible disk, a hard disk, a magneto-optical disk, a digital videodisk, a magnetic tape, a semiconductor memory and the like. Moreover, atransitory recording medium is the transmission medium (propagationsignal) itself. A transitory recording medium, for example, is anelectric signal, an optical signal, an electromagnetic waves and thelike. A temporary memory area is an area for temporarily storing dataand programs, and is, for example, a volatile memory such as RAM (RandomAccess Memory).

Advantageous Effects of Invention

With the present invention it is possible to provide a server, a servercontrol method, a program and a recording medium that are suitable fortranslating characters in an image on a WEB page in real-time.

BRIEF DESCRIPTION OF DRAWINGS

FIG. 1 is a drawing illustrating the relationship among a server, a WEBserver, and a terminal of embodiments of the present invention;

FIG. 2 is a drawing illustrating the basic construction of a typicalinformation processing device by which the server or the terminal of theembodiments of the present invention is achieved;

FIG. 3 is a drawing for explaining an example of a WEB page that isdisplayed by a terminal;

FIG. 4 is a drawing for explaining the basic construction of a server ofa first embodiment;

FIGS. 5A to 5C are drawings for explaining examples of an image on theWEB page;

FIG. 6 is a drawing for explaining an extracted information table;

FIG. 7 is a drawing for explaining an example of the image on the WEBpage;

FIG. 8 is a flowchart for explaining the control process performed byeach part of the server of the first embodiment;

FIGS. 9A to 9D are drawings for explaining specified areas;

FIG. 10 is a drawing for explaining an extracted information table;

FIG. 11 is a drawing for explaining a recognized area;

FIG. 12 is a drawing for explaining an extracted information table;

FIGS. 13A to 13D are drawings for explaining overlapping of thespecified area;

FIG. 14 is a flowchart for explaining the control process that isperformed by each part of a server of a second embodiment;

FIGS. 15A to 15C are drawings for explaining examples of the image onthe WEB page;

FIG. 16 is a drawing for explaining an extracted information table;

FIGS. 17A to 17D are drawings for explaining examples of the image onthe WEB page;

FIG. 18 is a flowchart for explaining the control process that isperformed by each part of a server of a third embodiment;

FIG. 19 is a drawing for explaining the basic construction of a serverof a fourth embodiment;

FIGS. 20A and 20B are drawings for explaining examples of the image onthe WEB page;

FIG. 21 is a flowchart for explaining the control process that isperformed by each part of a server of a fourth embodiment;

FIGS. 22A to 22C are drawings for explaining examples of the image onthe WEB page;

FIG. 23 is a flowchart for explaining the control process that isperformed by each part of a server of a fifth embodiment;

FIG. 24 is a drawing for explaining an extracted information table;

FIGS. 25A to 25C are drawings for explaining examples of the image onthe WEB page;

FIG. 26 is a flowchart for explaining the control process that isperformed by each part of a server of a sixth embodiment;

FIG. 27 is a drawing for explaining the basic construction of a terminalthat a program of a seventh embodiment operates; and

FIG. 28 is a flowchart for explaining the control process that isperformed by each part of the terminal of the seventh embodiment.

DESCRIPTION OF EMBODIMENTS

As illustrated in FIG. 1, a server 100 of embodiments of the presentinvention is connected to a Internet 300. Plural terminals 211, 212 to21 n (hereafter, these will collectively be called “terminal 200”) thatare operated by users and a WEB server 400 are connected to the Internet300. The WEB server 400 provides a specified WEB page to the terminal200 according to a request from the terminal 200. The server 100 thenperforms translation of the WEB page according to a request from theterminal 200, and provides the translation result to the terminal 200.In the embodiments that will be described later, the server 100 and theWEB server 400 are connected by way of the Internet 300 as illustratedin FIG. 1; however, this is only an example. For example, it is possiblefor the server 100 and the WEB server 400 to be achieved by the samedevice. Alternatively, the server 100 and the WEB server 400 can also beconnected directly.

In the following, a typical information processing device 500 by whichthe server 100 or the terminal 200 of the embodiments of the presentinvention is achieved will be explained.

(1. Basic Construction of an Information Processing Device)

As illustrated in FIG. 2, the information processing device 500 has aCPU (Central Processing Unit) 501, a ROM (Read Only Memory) 502, a RAM503, an NIC (Network Interface Card) 504, an image processor 505, anaudio processor 506, a DVD-ROM (Digital Versatile Disc ROM) drive 507,an interface 508, an external memory 509, a controller 510, a monitor511 and a speaker 512.

The CPU 501 controls the overall operation of the information processingdevice 500, and exchanges control signals and data with each of theconnected component elements.

The ROM 502 stores an IPL (Initial Program Loader) that is executedimmediately after the power has been turned ON, and by executing thisIPL, a specified program is read to the RAM 503, and the CPU 501 startsexecution of that program. The ROM 502 also stores an operating systemprogram and various data that are necessary for controlling the overalloperation of the information processing device 500.

The RAM 503 is for temporarily storing data and programs, and storesprograms and data that are read from a DVD-ROM, as well as data that isnecessary for communication.

The NIC 504 is for connecting the information processing device 500 to acomputer network such as the Internet 300, and has an interface (notillustrated in the figure) for acting as a go-between for devices suchas a device according to the 10 BASE-T/100 BASE-T standard that is usedwhen creating a LAN (Local Area Network), an analog modem, an ISDN(Integrated Services Digital Network) modem, an ADSL (Asymmetric DigitalSubscriber Line) modem for connecting to the Internet using an telephoneline, a cable modem for connecting to the Internet using a cabletelevision line, and the like, and the CPU 501.

After processing data that is read from a DVD-ROM or the like by the CPU501 or an image computation processor (not illustrated in the figure) ofthe image processor 505, the image processor 505 stores the result in aframe memory (not illustrated in the figure) of the image processor 505.The image information that is stored in the frame memory is converted toa video signal at specified synchronization timing, and outputted to themonitor 511. As a result, various pages become possible to be displayed.

The audio processor 506 converts audio data that is read from a DVD-ROMor the like to an analog audio signal, and causes that audio signal tobe outputted from the connected speaker 512. Moreover, under the controlof the CPU 501, the audio processor 506 creates sounds that are supposedto be generated during processing by the information processing device500, and causes corresponding audio to be outputted from the speaker512.

A program for achieving, for example, the server 100 of the embodiments,is recorded on a DVD-ROM that is mounted in the DVD-ROM drive 507. Undercontrol of the CPU 501, the DVD-ROM drive 507 performs processing toread the mounted DVD-ROM, reads the necessary program and data, andtemporarily stores those in the RAM 503 or the like.

The external memory 509, the controller 510, the monitor 511 and thespeaker 512 are connected to the interface 508 so as to be removable.

The external memory 509 stores data related to personal information ofthe user and the like so as to be rewritable.

The controller 510 receives operation input that is performed such aswhen performing various settings for the information processing device500. By inputting instructions by way of the controller 510, the user ofthe information processing device 500 is able to store the data in asuitable external memory 509.

The monitor 511 provides the user of the information processing device500 with data that is outputted from the image processor 505.

The speaker 512 provides the user of the information processing device500 with audio data that is outputted from the audio processor 506.

In addition, the information processing device 500 is potentiallyconstructed so as to be able to use a large-capacity external storagedevice such as a hard disk and the like in order to achieve the samefunctions as the ROM 502, the RAM 503, the external memory 509, aDVD-ROM that is mounted in the DVD-ROM drive 507 and the like.

In the following, the server 100 or the terminal 200 of the embodimentsthat are achieved by the information processing device 500 above will beexplained with reference to FIG. 1 to FIG. 28. By turning ON the powerto the information processing device 500, the program that causes theinformation processing device 500 to function as the server 100 orterminal 200 of the embodiments is executed, and the server 100 orterminal 200 is achieved.

FIG. 3 illustrates an example of a WEB page that is provided to theterminal 200 by the WEB server 400 in the embodiments below. The WEBpage 600 includes images 601, 602 and 603 and texts 604, 605 and 606.The images 601 and 603 are images that include text, and the image 602is an image that does not include text. Moreover, the image URLs of theimage 601, 602 and 603 are taken to be “http://xxx.601.jpg”,“http://xxx.602.jpg” and “http://xxx.603.jpg”.

(2. Basic Construction of a Server of a First Embodiment)

When character recognition has already been performed for an image whentranslating a character string that is included in the image, a server100 of a first embodiment is able to omit the processing for acquiringthe image and performing the character recognition.

As illustrated in FIG. 4, the server 100 of this embodiment has arequest receiver 101, an image processor 102, and a response transmitter103.

The request receiver 101 receives a request, in which image IDinformation and a parameter for image processing are specified, from theterminal 200.

The image ID information, for example, is an image URL. Moreover, imageprocessing, for example, is a character recognition processing forrecognizing characters inside an image. A parameter for imageprocessing, for example, is a language for recognition that indicateswhich language the characters inside an image are in. Characterrecognition is performed for the characters inside the image presumed tobe in the specified language for recognition.

In the following, an example is explained for the case in which a userrequests translation of the image 601 of the WEB page 600 from theserver 100. For example, when a user places a cursor 701 (mouse over)over the image 601 that is displayed on the terminal 200, a pop up 801,requesting that the language for recognition be specified, is displayedas illustrated in FIG. 5A. As illustrated in FIG. 5B, when a user clickson a button 802 with the cursor 701, a pull-down menu is displayed. Theuser then selects the language (language for recognition) of thecharacter string inside the image 601 from the menu. As illustrated inFIG. 5C, when “English” is specified as the language for recognition, arequest, in which the image URL “http://xxx.601.jpg” of the image 601and the language for recognition “English” are specified, is transmittedto the server 100. The request receiver 101 of the server 100 receivesthat request.

In this embodiment, the CPU 501 and the NIC 504 work together tofunction as the request receiver 101. The same is true for the followingsecond through sixth embodiments.

The image processor 102 acquires an image based on the ID informationthat is specified in the received request, and by applying imageprocessing using the parameter that is specified in the receivedrequest, outputs an extracted information that was extracted from insidethe image.

Here, the extracted information, for example, is the characterrecognition result, which is a character string inside the obtainedimage (in the following, this will be called the “recognized characterstring”). For example, when a request that specifies the image URL“http://xxx.601.jpg” and the language for recognition “English” isreceived, the image processor 102 first acquires the image 601 from theWEB server 400 based on that image URL. The image processor 102 performscharacter recognition presuming the language of the characters includedin the acquired image 601 to be English. As a result of characterrecognition, it is determined that the characters inside the image 601are “Sale Water Best bang for the buck”, and then the image processor102 outputs these character strings as recognized character strings thatwere extracted from inside the image 601.

Moreover, after the extracted information has been extracted, the imageprocessor 102 associates and caches that extracted information with theimage ID information and the parameter.

For example, an extracted information table 101 a such as illustrated inFIG. 6 is stored in the RAM 503. In the extracted information table 101a, image ID information (image URLs) 101 a 1, parameters (languages forrecognition) 101 a 2, extracted information (recognized characterstrings) 101 a 3 are associated and registered. The image processor 102,by using character recognition, obtains the recognized character string“Sale Water Best bang for the buck”, and then associates and registersthat recognized character string and the image URL of the image 601“http://xxx.601.jpg” with the language for recognition “English” in theextracted information table 101 a.

In this embodiment, the CPU 501 and the image processor 505 worktogether to function as the image processor 102. The same is true in thesecond through sixth embodiments below as well.

The response transmitter 103 transmits a response in which the extractedinformation that was outputted by the image processor 102 is specifiedto the terminal 200.

For example, the response transmitter 103 transmits a response in whichthe recognized character string “Sale Water Best bang for the buck” isspecified to the terminal 200. Here, that response, for example, alsoincludes a command to add a tag for overlaying and displaying anarbitrary shape to the image in the WEB page that is displayed on theterminal 200 for which character recognition is completed. For example,as illustrated in FIG. 7, after the terminal 200 receives that response,a semi-transparent rectangle 901 is overlaid and displayed on the image601 for which character recognition is complete.

In this embodiment, the CPU 501 and the NIC 504 work together tofunction as the response transmitter 103. The same is also true for thesecond through sixth embodiments below.

When the extracted information that is associated with the IDinformation and parameter that are specified in the received request iscached, the image processor 102, instead of acquiring an image andapplying image processing, obtains the cached extracted information, andoutputs that obtained extracted information.

For example, the request receiver 101 receives a request, in which theimage URL of the image 601 and the language for recognition “English”are specified, and the image processor 102 references the extractedinformation table 101 a in FIG. 6. The recognized character string “SaleWater Best bang for the buck” that corresponds to the ID information ofthe image 601 and the language for recognition “English” is registered,so that the image processor 102 omits the process for obtaining theimage 601 and performing character recognition, and outputs thatrecognized character string to the response transmitter 103.

(3. Operation of the Server of the First Embodiment)

Next, the flowchart in FIG. 8 will be used to explain the operationsthat are performed by all of the parts of the server 100 of thisembodiment. After the power has been turned ON to the server 100, theCPU 501 starts the control process that is illustrated in the flowchartin FIG. 8.

The request receiver 101 receives the request, in which the image IDinformation (image URL) and the parameter (language for recognition) forimage processing are specified, from the terminal 200 (step S101). Forexample, when the user places the cursor 701 over the image 601 andspecifies the language for recognition “English” by following theprocedure illustrated in FIGS. 5A to 5C, the request receiver 101receives the request in which the image URL “http://xxx.601.jpg” and thelanguage for recognition “English” are specified.

The image processor 102 then determines whether or not the extractedinformation (recognized character string) that is associated with the IDinformation and the language for recognition that are specified in thereceived request are cached (step S102).

When it is determined that the recognized character string is cached(step S102; YES), the image processor 102 outputs the recognizedcharacter string that is cached (step S103). For example, when theextracted information table 101 a illustrated in FIG. 6 is stored in theRAM 503, the image processor 102 outputs the recognized character string“Sale Water Best bang for the buck” that is associated with the imageURL “http://xxx.601.jpg” and the language for recognition “English” tothe response transmitter 103.

On the other hand, when it is determined that the recognized characterstring is not cached (step S102; NO), the image processor 102 acquiresthe image of the image URL that is specified in the received request(step S105). For example, when information about the recognizedcharacter string that is associated with the image URL“http://xxx.601.jpg” and the language for recognition “English” is notregistered in the extracted information table 101 a, the image processor102 accesses the WEB server 400 and acquires the image 601 thatcorresponds to the image URL “http://xxx.601.jpg”.

After acquiring the image, the image processor 102 then performscharacter recognition using the language for recognition that wasspecified in the request, and extracts a character string (recognizedcharacter string) from inside the image (step S106). For example, theimage processor 102 performs character recognition of the image 601presuming that the characters inside the image 601 are “English”, andobtains the recognized character string “Sale Water Best bang for thebuck”.

The image processor 102 associates and caches the recognized characterstring with the image ID information and the language for recognition(step S107). For example, as illustrated in FIG. 6, the image processor102 associates the recognized character string “Sale Water Best bang forthe buck” with the image URL “http://xxx.601.jpg” and the language forrecognition “English”, and registers these in the extracted informationtable 101 a.

The image processor 102 then outputs the extracted recognized characterstring (step S108). For example, the image processor 102 outputs therecognized character string “Sale Water Best bang for the buck” to theresponse transmitter 103 as the result of character recognition.

The response transmitter 103 transmits the response, in which therecognized character string that was outputted in step S103 or step S108is specified, to the transmitter 200 (step S104). For example, theresponse transmitter 103 transmits a response, in which the recognizedcharacter string “Sale Water Best bang for the buck” is specified, tothe terminal 200.

With this embodiment, when the result of character recognition for animage is already obtained, it is possible to omit the process ofacquiring the image and performing character recognition. Then, bytranslating the character strings for which the result of characterrecognition has been obtained, it is possible to shorten the timerequired for translating the character strings inside the image.

(4. Basic Construction of a Server of a Second Embodiment)

A server 100 of a second embodiment is able to omit the process ofacquiring an image and performing character recognition when the processof character recognition has already been performed for an areaspecified inside an image when translating character strings that areincluded in the image.

The server 100 of this embodiment, as in the first embodiment, has arequest receiver 101, an image processor 102 and a response transmitter103 (see FIG. 4); however, the functions and information handled by eachpart are different from those of the first embodiment. This will beexplained in detail below.

The request receiver 101 receives a request, in which ID information foran image and a specified area inside that image are specified, from theterminal 200.

Here, the specified area, for example, is an area inside the image of aWEB page that is specified by a user operating the terminal 200. Forexample, the specified area is taken to be an area that is designated bya rectangular shape and specified by the coordinates of the upper leftcorner and lower right corner of the rectangle.

For example, as illustrated in FIG. 9A, the user uses the terminal 200and selects an area 902 inside the image 601 by dragging a cursor 701.Here, the coordinates of the upper left corner 911 of the image 601(taken to be the “(vertical coordinate, horizontal coordinate)”) are setas (0, 0), and the coordinates of the lower right corner 912 are set as(200, 300). Moreover, the coordinates of the upper left corner of area902 are taken to be (5, 100), and the coordinates of the lower rightcorner are (48, 200). After the area 902 has been specified by the user,the terminal 200 transmits a request, in which the image URL“http://xxx.601.jpg” of the image 601, and the specified area “(5,100):(48, 200)” are specified, to the server 100. The request receiver101 of the server 100 receives that request.

The method for specifying the specified area is not limited to thatdescribed above. For example, as illustrated in FIG. 9B, a rectangle 903having a specified size with a cursor 701 in the center can be displayedwhen the cursor is placed inside the image, and the area of thatrectangle 903 is potentially specified as the specified area. Moreover,the specified area is not limited to a rectangle, and could also be acircle and the like.

The image processor 102 acquires an image based on the ID informationthat is specified in the received request, and by performing characterrecognition of the specified area inside the acquired image, outputs arecognized character string that was recognized from that specified areain the image.

For example, when a request, in which the image URL “http://xxx.601.jpg”and the specified area “(5, 100):(48, 200)” are specified, is received,the image processor 102 first acquires the image 601 from the WEB server400 based on the image URL. Next, the image processor 102 determines thelanguage for recognition for performing character recognition. Forexample, when a domain code indicating a country is included in the URLof the WEB page 600, the image processor 102 determines from that domaincode the language of the characters included in the image. For example,when “.UK” is included in the domain code, the image processor 102determines “English” as the language for recognition. Alternatively, theimage processor 102 determines whether there is a tag in the html thatspecifies the language, and when there is such a tag, takes thatlanguage to be the language for recognition. For example, when there isa tag “<html lang=“en”>” inside the html, the image processor 102 takes“English” to be the language for recognition. In this embodiment, it ispresumed that the image processor 102 has determined “English” to be thelanguage for recognition. The image processor 102 performs characterrecognition presuming the characters that are included in the specifiedarea “(5, 100):(48, 200)” in the acquired image 601 to be “English”.When the characters inside that specified area are determined to be“Sale” as a result of character recognition, the image processor 102outputs this character string as the recognized character string.

After the recognized character string has been extracted, the imageprocessor 102 associates the recognized character string with the imageID information and the specified area, and caches that recognizedcharacter string.

For example, as illustrated in FIG. 10, the image processor 102associates the recognized character string “Sale” with the image URL“http://xxx.601.jpg” of the image 601 and the language for recognition“English” and the specified area “(5, 100):(48, 200)”, and registersthem in the extracted information table 101 a.

Here, after the recognized character string has been extracted, it ispossible for the image processor 102 to correct the specified area to arecognized area inside the specified area where the recognized characterstring was recognized, and then cache that recognized character string.

The recognized area, for example, is a rectangle that includes arecognized character string, and is an area that is identified by arectangle for which lengths in the vertical direction and horizontaldirection of the rectangle have been minimized. For example, therecognized area for “Sale” (area 921 in FIG. 11) is “(8, 110):(45,170)”, the recognized area for “Water” (area 922 in FIG. 11) is “(60,120):(90, 180)”, and the recognized area for “Best bang for the buck”(area 923 in FIG. 11) is “(110, 25):(190, 270)”.

For example, as illustrated in FIG. 9A, a request is received in whicharea 902 is specified as the specified area, and the specified area isspecified as “(5, 100):(48, 200)”. In this case, the image processor 102performs character recognition for that specified area, and after thecharacter string “Sale” has been extracted, the image processor 102obtains the recognized area in which that character string is included.Then, as illustrated in FIG. 12, the image processor 102 associates therecognized character string “Sale” with the image URL“http://xxx.601.jpg”, the language for recognition “English” and thespecified area (recognized area) “(8, 110):(45, 170)”, and registersthem in the extracted data table 101 a.

Moreover, as illustrated in FIG. 9C, for example, the area 904 thatprotrudes from the image 601 is specified as the specified area, and arequest in which the specified area “(52, −20):(98, 320) is specified isreceived. In this case, as illustrated in FIG. 12, the image processor102 associates the recognized character string “Water” with the imageURL “http;//xxx.601.jpg” of the image 601, the language for recognition“English” and the specified area (recognized area) “(60, 120):(90,180)”, and registers them in the extracted information table 101 a.

Furthermore, as illustrated in FIG. 9D, for example, an area 905 thecuts through the character string inside the image 601 is specified asthe specified area, and a request in which the specified area “(85,15):(210, 285)” is specified is received. In this case, as illustratedin FIG. 12, the image processor 102 associates the recognized characterstring “Best bang for the buck” with the image URL “http;//xxx.601.jpg”of the image 601, the language for recognition “English” and thespecified area (recognized area) “(110, 25):(190, 270)”, and registersthem in the extracted information table 101 a.

The response transmitter 103 transmits a response, in which therecognized character string that was outputted from the image processor102 is specified, to the terminal 200.

For example, after receiving a response in which the recognizedcharacter string “Sale” is specified, the terminal 200, as illustratedin FIG. 13A, displays a semi-transparent rectangle 931 over therecognized area of the recognized character string “Sale” for whichcharacter recognition is finished. Similarly, as illustrated in FIG. 13Bto FIG. 13D, semi-transparent rectangles 932, 933 are displayed over therecognized areas.

When the recognized character string, for which the ID information andthe specified area included in the parameter that are specified in thereceived request are associated, is cached, the image processor 102 thenobtains the cached recognized character string instead of obtaining animage and performing character recognition, and outputs that obtainedrecognized character string.

Here, when the ID information that was associated with the recognizedcharacter string when that recognized character string was cachedmatches the ID information that is specified in the received request,and the specified area that was associated with recognized characterstring when that recognized character string was cached overlaps thespecified area that is specified in the received request, the imageprocessor 102 determines that extracted information that is associatedwith the ID information and specified area that is included in theparameter that are specified in the received request is cached.

For example, as illustrated in FIG. 13A, when a user specifies area 906,the request receiver 101 receives a request that specifies the image URLof the image 601 and the specified area “(5, 35):(52, 200)”. After thatrequest has been received, the image processor 102 references theextracted information table 101 a in FIG. 12. There is a recognizedcharacter string registered in the extracted information table 101 athat the image URL of the image 601 matches and for which the specifiedarea “(5, 35):(52, 200)” that is specified in the request overlaps thecached specified area “(8, 110):(45, 170)”. Therefore, the imageprocessor 102 omits the process of acquiring the image and performingcharacter recognition, and outputs the recognized character string“Sale” to the response transmitter 103.

Determining whether or not the specified area that is specified in therequest overlaps the specified area that is cached is can beappropriately set up.

For example, as illustrated in FIG. 13A and FIG. 13B, when the specifiedareas (areas 906, 907) that are specified in the request include all ofthe cached specified areas (recognized areas, rectangles 931, 932), itis determined that the areas overlap. On the other hand, as illustratedin FIG. 13C, when only part of the specified area (area 908) that isspecified in the request overlaps the cached specified area (recognizedarea, rectangle 933), it is determined that the areas do not match.Alternatively, it is also possible to determine that the areas overlapwhen the recognized area and the specified area, which is specified inthe request, overlap by only a specified ratio of the recognized area.Moreover, as illustrated in FIG. 13D, when the cursor 701 is placed overthe recognized area (rectangle 933), it is possible to determine thatthe areas overlap.

(5. Operation of the Server of the Second Embodiment)

Next, the operations that are performed by each of the parts of theserver 100 of this embodiment will be explained using the flowchart inFIG. 14. After the power has been turned ON to the server 100, the CPU501 starts the control process that is illustrated in the flowchart inFIG. 14.

The request receiver 101 receives the request from the terminal 200 inwhich the image ID information and the specified area are specified(step S201). For example, when the user specifies the area 902 in FIG.9A, the request receiver 101 receives the request in which the image URL“http;//xxx.601.jpg, and the specified area “(5, 100):(48, 200)” arespecified.

The image processor 102 then determines whether or not extractedinformation (recognized character string), which is associated with theID information (image URL) and the specified area that are specified inthe received request, is cached (step S202).

When it is determined that the recognized character string is cached(step S202; YES), the image processor 102 outputs the cached recognizedcharacter string (step S203). For example, the image processor 102references the extracted information table 101 a that is illustrated inFIG. 12, and determines that the image URL matches and that thespecified area “(5, 100):(48, 200)” that is specified in the request andthe cached specified area (recognized area) “(8, 110):(45, 170)”overlap. In this case, the image processor 102 outputs the recognizedcharacter string “Sale” that is associated with the image URL and therecognized area to the response transmitter 103.

On the other hand, when it is determined that the recognized characterstring is not cached (step S202; NO), the image processor 102 acquiresthe image of the image URL that is specified in the received request(step S205). For example, when the image URL “http://xxx.601.jpg” doesnot match, or when the specified area that is specified in the requestand the cached specified area do not overlap, the image processor 102accesses the WEB server 400 and acquires the image 601 that correspondsto the image URL “http://xxx.601.jpg”.

After the image has been acquired, the image processor 102 determinesthe language for recognition, then performs character recognition usingthat language for recognition and extracts the recognized characterstring from inside the image (step S206). For example, presuming thecharacters in the specified area (area 902) of the image 601 to beEnglish, the image processor 102 performs character recognition of thearea 902 and obtains the recognized character string “Sale”.

The image processor 102 associates and caches the recognized characterstring with the image ID information, the language for recognition andthe specified area (step S207). For example, as illustrated in FIG. 10,the image processor 102 associates the recognized character string“Sale” with the image URL “http://xxx.601.jpg”, the language forrecognition “English” and the specified area “(5, 100):(48, 200)”, andregisters them in the extracted information table 101 a. It is alsopossible to correct the specified area to the recognized area “(8,110):(45, 170)” before registering, and to register the correctedrecognized area as illustrated in FIG. 12.

The image processor 102 then outputs the extracted recognized characterstring (step S208). For example, the image processor 102 outputs therecognized character string “Sale” as the result of characterrecognition to the response transmitter 103.

The response transmitter 103 transmits the response, in which therecognized character string that was outputted in step S203 or step S208is specified, to the terminal 200 (step S204). For example, the responsetransmitter 103 transmits the response, in which the recognizedcharacter string “Sale” is specified, to the transmitter 200.

With this embodiment, when the result of character recognition hasalready been obtained for an area inside an image specified by a user,it is possible to omit the process of acquiring the image and performingcharacter recognition. Moreover, by translating a character string thatwas obtained as the result of character recognition, it is possible toshorten the time required for translating a character string inside animage.

(6. Basic Construction of a Server of a Third Embodiment)

A server 100 of a third embodiment makes it possible to omit the processof acquiring an image, performing character recognition and performingtranslation when translating a character string that is included insidean image, in the case where character recognition and translation havealready been performed for a specified area inside the image.

The server 100 of this embodiment, as in the second embodiment, includesa request receiver 101, an image processor 102, and a responsetransmitter 103 (FIG. 4); however, the functions of each of the partsand the information handled differ from those of the second embodiment.This will be explained in detail below.

The request receiver 101 receives a request, in which image IDinformation, a language for recognition, a specified area and conversioninformation are specified, from the terminal 200.

The conversion information, for example, is the language (hereafter,referred to as the translation target language) after translation of acharacter string that is included in an image. A recognized characterstring that has been extracted from inside an image is translated to thetranslation target language using the language for recognition.

In the following, an example is explained for the case in which a userrequests the server 100 to perform translation of the specified area(area 909 having the coordinates “(100, 20):(200, 280)”) inside theimage 601 on the WEB page 600. For example, when a user, using theterminal 200, drags the cursor 701 and selects the area 909 inside theimage 601, a pop-up 803, asking for the language for recognition and thetranslation target language, is displayed as illustrated in FIG. 15A.Then, as illustrated in FIG. 15B, when the user uses the cursor 701 andclicks on a button 804, a pull-down menu is displayed. The user selectsfrom the menu the language (language for recognition) of a characterstring inside the area 909. Similarly, the user clicks on the button 805for the translation target language and selects from the menu thelanguage (translation target language) to which the character stringinside the area 909 is to be translated. As illustrated in FIG. 15C,after the language for recognition “English” and the translation targetlanguage “Japanese” have been specified, a request, in which the imageURL “http://xxx.601.jpg” of the image 601, the specified area “(100,20):(200, 280)”, the language for recognition “English” and thetranslation target language “Japanese” have been specified, istransmitted to the server 100. The request receiver 101 of the server100 receives that request.

The image processor 102 acquires an image based on the ID informationthat was specified in the received request, and by performing characterrecognition of the specified area inside the acquired image, outputs arecognized character string that was recognized from the specified areainside the image. The image processor 102 then, by performing conversionof the outputted recognized character string using the conversioninformation (translation target language) that was specified in thereceived request, outputs a converted character string that wasconverted from the recognized character string.

For example, the image processor 102 acquires the image 601 thatcorresponds to the image URL “http://xxx.601.jpg” that was specified inthe request from the WEB server 400. Next, the image processor 102extracts the recognized character string “Best bang for the buck” fromthe specified area “(100, 20):(200, 280)” using the language forrecognition “English”. The image processor 102 then converts thatrecognized character string to the translation target language“Japanese”, and outputs the converted character string “

(Best bang for the buck)” to the response transmitter 103.

After the converted character string has been extracted, the imageprocessor 102 associates the converted character string with the imageID information, the specified area, the language for recognition, therecognized character string and the conversion information, and cachesthat converted character string. In this embodiment, the specified areathat is cached is the recognized area.

For example, as illustrated in FIG. 16, the image processor 102associates the converted character string “

” with the image URL “http://xxx.601.jpg” of the image 601, the languagefor recognition “English”, the specified area “(110, 25):(190, 270)”,the translation target language “Japanese” and the recognized characterstring “Best bang for the buck”, and registers them in the extractedinformation table 101 a. The information to be registered is not limitedto this combination, and, for example, it is also possible to notregister the translation target information and the converted characterstring.

When the converted character string has been cached for the IDinformation (image URL), the specified area, the language forrecognition and the conversion information (translation target language)that are specified in the received request, the image processor 102 thenobtains the cached converted character string instead of performing theconversion process, and outputs the obtained converted character string.

For example, the user specifies the area 910 (the coordinates are “(90,0):(200, 290)”) as illustrated in FIG. 17A, and in a pop-up 806, selectsthe language for recognition “English” and the translation targetlanguage “Japanese”. In this case, the request receiver 101 receives arequest in which the image URL of the image 601, the specified area“(90, 0):(200, 290)”, the language for recognition “English” and thetranslation target language “Japanese” are specified. The imageprocessor 102 references the extracted information table 101 a in FIG.16 and determines that the image URL, the language for recognition andthe translation target language that are specified in the request matchthe cached information for these, and determines that the specified area“(90, 0):(200, 290)” that was specified in the request, and thespecified area “(110, 25):(190, 270)” that is cached overlap. Therefore,the image processor 102 omits the processes of acquiring the image,performing character recognition and performing conversion, and outputsthe converted character string “

” to the response transmitter 103.

When the image URL and the language for recognition match, and thespecified area that is specified in the request and the specified areathat is cached overlap, the image processor 102 can omit only theprocesses of acquiring the image and performing character recognition,and use the recognized character string that is registered in theextracted information table 101 a to convert to the translation targetlanguage and obtain the converted character string. For example, therequest receiver 101 receives a request in which the image URL of theimage 601, the specified area “(90, 0):(200, 290)”, the language forrecognition “English” and the translation target language “German” arespecified. In this case, a corresponding converted character string isnot registered in the extracted information table 101 a in FIG. 16, sothat the image processor 102 performs the conversion processing oftranslating the recognized character string “Best bang for the buck” toGerman, and outputs the converted character string.

The response transmitter 103 transmits a response, in which theconverted character string that was outputted from the image processor102 is specified, to the terminal 200.

For example, when the terminal 200 receives the response in which theconverted character string “

” is specified, then as illustrated in FIG. 17C, a pop-up 807, in whichthe converted character string “

” is listed, is displayed on the WEB page 600. Alternatively, asillustrated in FIG. 17D, it is possible to display a rectangle 941, inwhich the converted character string is listed, over the recognizedarea.

(7. Operation of the Server of the Third Embodiment)

Next, the operations that are performed by each of the parts of theserver 100 of this embodiment will be explained using the flowchart inFIG. 18. After the power has been turned ON to the server 100, the CPU501 starts the control process illustrated in the flowchart in FIG. 18.

The request receiver 101 receives the request from the terminal 200 inwhich the ID information (image URL) of the image, the specified area,the language for recognition, and the conversion information(translation target language) are specified (step S301). For example,the user specifies the area 909 in FIG. 15A, and as illustrated in FIG.15C, by specifying the language for recognition and the translationtarget language in the pop-up 803, the request receiver 101 receives arequest in which the image URL “http://xxx.601.jpg”, the specified area“(100, 20):(200, 280)”, the language for recognition “English” and thetranslation target language “Japanese” are specified.

The image processor 102 then determines whether or not a convertedcharacter string, in which the ID information, the specified area, thelanguage for recognition and that translation target language areassociated, is cached (step S302).

When the image processor 102 determines that the converted characterstring is cached (step S302; YES), the image processor 102 outputs theconverted character string that is cached (step S303). For example, theimage processor 102 references the extracted information table 101 aillustrated in FIG. 16, and determines that the image URL, the languagefor recognition and the translation target language match, anddetermines that the specified area “(100, 20):(200, 280)” that wasspecified in the request and the specified area “(110, 25):(190, 270)”that is cached overlap. In this case, the image processor 102 outputsthe converted character string “

” that is associated with the image URL, the specified area, thelanguage for recognition and the translation target language to theresponse transmitter 103.

On the other hand, when it is determined that the converted characterstring is not cached, (step S302; NO), the image processor 102determines whether or not a recognized character string that isassociated with the image URL, the specified area and the language forrecognitions is cached (step S305).

When it is determined that the recognized character string is cached(step S305; YES), the image processor 102 converts the cached recognizedcharacter string to the translation target language and obtains theconverted character string (step S306). For example, when thetranslation target language “Japanese” and the converted characterstring “

” are not registered in the extracted information table 101 a in FIG.16, the image processor 102 coverts the recognized character string“Best bang for the buck” to the translation target language in“Japanese” and obtains the converted character string “

”.

On the other hand, when the recognized character string is not cached(step S305; NO), the image processor 102 acquires the image of the imageURL that is specified in the received request (step S309). Then, theimage processor 102 performs character recognition for the specifiedarea inside that image and extracts the recognized character string(step S310). For example, when the image URL “http://xxx.601.jpg” do notmatch, or when the specified area that is specified in the request andthe specified area that is cached do not overlap, the image processor102 accesses the WEB server 400 and acquires the image 601 thatcorresponds to the image URL “http://xxx.601.jpg”. The image processor102 then performs character recognition for the specified area using thelanguage for recognition “English” that is specified in the request, andextracts the recognized character string “Best bang for the buck”.

In step S310, after the recognized character string has been extracted,the image processor 102 performs conversion on that recognized characterstring (step S306).

The image processor 102 associates and caches the converted characterstring with the image ID information, the language for recognition, thetranslation target language, the specified area and the recognizedcharacter string (step S307). The image processor 102 then outputs theacquired converted character string to the response transmitter 103(step S308). For example, as illustrated in FIG. 16, the image processor102 associates the converted character string “

” with the image URL “http://xxx.601.jpg”, the language for recognition“English”, the translation target language “Japanese”, the specifiedarea (recognized area) “(110, 25):(190, 270)” and the recognizedcharacter string “Best bang for the buck”, and registers them in theextracted information table 101 a. The image processor 102 then outputsthe converted character string “

” to the response transmitter 103.

Next, the image processor 102 transmits the response, in which theconverted character string that was outputted in step S303 or step S308is specified, to the terminal 200 (step S304). For example, the responsetransmitter 103 transmits the response, in which the converted characterstring “

” is specified, to the terminal 200.

With this embodiment, when the result of conversion (translation result)of an area inside an image that was specified by a user has already beenobtained, it is possible to omit the processes of acquiring the image,performing character recognition and performing conversion. As a result,the time required for translating a character string inside an image canbe shortened.

(8. Basic Construction of a Server of a Fourth Embodiment)

A server 100 of a fourth embodiment submits to a user whether or notprocessing for character recognition of an image is completed.

The server 100 of this embodiment, as illustrated in FIG. 19, includes arequest receiver 101, an image processor 102, a response transmitter103, an inquiry receiver 104 and an answer transmitter 105. The requestreceiver 101, the image processor 102 and the response transmitter 103of this embodiment have the same functions as in the first embodiment.In the following, the inquiry receiver 104 and answer transmitter 105that have different functions will be explained.

The inquiry receiver 104 receives an inquiry from the terminal 200 inwhich image ID information is specified.

For example, as illustrated in FIG. 20A, when the user places the cursor701 inside the image 601 (mouse over), the terminal 200 transmits aninquiry, in which the image URL of the image 601 is specified, to theserver 100. The inquiry receiver 104 of the server 100 receives thatinquiry.

In this embodiment, the CPU 501 and the NIC 504 working togetherfunction as the inquiry receiver 104. The same is also true in the fifthand sixth embodiments described later.

When the extracted information that is associated with the IDinformation that was specified in the received inquiry is not cashed,the answer transmitter 105 transmits an answer, in which a message isspecified that prompts transmission of a request, in which the image IDinformation and the parameter of the image processing are specified, tothe terminal 200, and when the extracted information that is associatedwith the ID information that is specified in the received inquiry iscached, the answer transmitter 105 transmits an answer, in which thatextracted information is specified, to the terminal 200.

Here, the parameter of image processing is the language for recognitionfor character recognition, and the extracted information is a recognizedcharacter string. The extracted information table 101 a such asillustrated in FIG. 6 is stored in the server 100. The character string“Sale Water Best bang for the buck” that corresponds to the image URL“http://xxx.601.jpg” of the image 601 that is specified in the requestis registered in that extracted information table 101 a. Therefore, theanswer transmitter 105 transmits an answer, in which the recognizedcharacter string is specified, to the terminal 200. After the terminal200 receives that answer, a pop-up 809 that includes the recognizedcharacter string is displayed on the WEB page that is displayed on theterminal 200 as illustrated in FIG. 20A.

When the recognized character is cached, it also possible for a message,which indicates that the recognized character string was obtained, to bespecified in the answer that is transmitted. After the terminal 200receives the answer, a pop-up 810 that includes the message indicatingthat the recognized character string was obtained is displayed asillustrated in FIG. 20B.

On the other hand, when the recognized character string that correspondsto the image URL of the image 601 is not cached in the server 100, theanswer transmitter 105 transmits an answer, in which a message isspecified that prompts transmission of a request in which the image URLof the image 601 and the language for recognition are specified, to theterminal 200. When the terminal 200 receives that answer, a pop-up 801that asks for input of the language for recognition is displayed on theWEB page that is shown on the terminal 200 as illustrated in FIG. 5A.

In this embodiment, the CPU 501 and the NIC 504 work together tofunction as the answer transmitter 105. The same is also true in thefifth and sixth embodiment that are described later.

(9. Operation of the Server of the Fourth Embodiment)

Next, the operations that are performed by each of the parts of theserver 100 of this embodiment will be explained using the flowchart inFIG. 21. After the power has been turned ON to the server 100, the CPU501 starts the control process that is illustrated in the flowchart inFIG. 21.

The inquiry receiver 104 receives the inquiry from the terminal 200 inwhich image ID information is specified (step S401). For example, theinquiry receiver 104 receives an inquiry in which the image URL of theimage 601 is specified.

The answer transmitter 105 determines whether or not the recognizedcharacter string that is associated with the ID information that wasspecified in the inquiry is cached (step S402).

When it is determined that the recognized character string is cached(step S402; YES), the answer transmitter 105 transmits an answer, inwhich the cached recognized character string is specified, to theterminal 200 (step S403). For example, when the recognized characterstring “Sale Water Best bang for the buck” that is associated with theimage URL of the image 601 is registered in the extracted informationtable 101 a (FIG. 6) that is stored in the server 100, the answertransmitter 105 transmits an answer, in which that character string isspecified, to the terminal 200.

On the other hand, when it is determined that the recognized characterstring is not cached (step S402; NO), the answer transmitter 105transmits an answer, in which a message is specified prompting forspecification of the language for recognition, to the terminal 200 (stepS404). For example, when the recognized character string thatcorresponds to the image URL of the image 601 is not registered in theextracted information table 101 a that is stored in the server 100, theanswer transmitter 105 transmits an answer, in which a message isspecified prompting transmission of a request in which the image URL ofimage 601 and the language for recognition are specified, to theterminal 200.

After that, when the request, in which the image URL of the image 601and the language for recognition are specified, is received from theterminal 200, the server 100 starts, for example, the control processillustrated in the flowchart in FIG. 8.

With this embodiment, by simply placing a mouse over an image or thelike, it is possible to notify a user whether or not processing forcharacter recognition of that image has been completed.

(10. Basic Construction of a Server of a Fifth Embodiment)

A server 100 of a fifth embodiment provides a user with whether or notprocessing for character recognition of an area inside an image has beencompleted.

The server 100 of this embodiment, as illustrated in FIG. 19, includes arequest receiver 101, an image processor 102, a response transmitter103, an inquiry receiver 104 and an answer transmitter 105. The requestreceiver 101, the image processor 102 and the response transmitter 103have the same functions as in the second embodiment. In the following,the inquiry receiver 104 and the answer transmitter 105 that havedifferent functions will be explained.

The inquiry receiver 104 receives an inquiry, in which image IDinformation and a position inside that image are specified, from theterminal 200.

For example, as illustrated in FIG. 22A, when a user places the cursor701 inside the image 601 (mouse over), the terminal 200 transmits aninquiry, in which the image URL of the image 601 and the coordinates(“(75:175)”) of the position of the cursor 701 are specified, to theserver 100. The inquiry receiver 104 of the server 100 receives thatinquiry.

When a recognized character string, which is associated with the IDinformation that is specified in the received inquiry, and with thespecified area that includes the position that is specified in thereceived inquiry, is cached by the image processor 102, the answertransmitter 105 transmits an answer, in which the cached recognizedcharacter string is specified, to the terminal 200.

For example, it is presumed that the extracted information table 101 aillustrated in FIG. 12 is stored in the server 100. As illustrated inFIG. 22A, the coordinates “(75:175)” of the position that was specifiedin the request is included in the recognized area 922. The recognizedcharacter string “Water”, which corresponds to the image URL“http://xxx.601.jpg” of the image 601 that was specified in the requestand the specified area (recognized area 922) that includes thecoordinates of the position that was specified in the request, isregistered in that extracted information table 101 a. Therefore, theanswer transmitter 105 transmits an answer, in which the recognizedcharacter string is specified, to the terminal 200. When the terminal200 receives that answer, a pop-up 811 that includes the recognizedcharacter string “Water” is displayed on the WEB page that is shown onthe terminal 200 as illustrated in FIG. 22A.

It is also possible to specify a message prompting transmission of arequest in which the translation target language is specified in theanswer that is transmitted when the recognized character string iscached. After the terminal 200 receives the answer, a pop-up 812requesting that the translation target language be specified isdisplayed as illustrated in FIG. 22B.

On the other hand, when the recognized character string, whichcorresponds to the image URL of the image 601, and to the specified areain which the specified position is included, is not cached in the server100, the answer transmitter 105 transmits an answer, in which a messageis specified prompting transmission of a request in which the image URLof the image 601, the position inside the image and the language forrecognition are specified for that specified area, to the terminal 200.When the terminal 200 receives that answer, a pop-up 813 requestinginput of the language for recognition is displayed on the WEB page thatis shown on the terminal 200 as illustrated in FIG. 22C.

(11. Operation of the Server of the Fifth Embodiment)

Next, the operations that are performed by each of the parts of theserver 100 of this embodiment will be explained using the flowchart inFIG. 23. After the power has been turned ON to the server 100, the CPU501 starts the control process that is illustrated in the flowchart inFIG. 23.

The inquiry receiver 104 receives an inquiry from the terminal 200 inwhich image ID information and a position inside that image arespecified (step S501). For example, the inquiry receiver 104 receives aninquiry in which the image URL of the image 601 and the position“(75:175)” are specified.

The answer transmitter 105 determines whether or not a recognizedcharacter string, which is associated with the ID information and thespecified area that includes the position that were specified in theinquiry, is cached (step S502).

When it is determined that the recognized character string is cached(step S502; YES), the answer transmitter 105 transmits an answer, inwhich the cached recognized character string is specified, to theterminal 200 (step S503). For example, when the recognized characterstring “Water” that corresponds to the image URL of the image 601 andthe specified area that includes the specified position “(75:175)” isregistered in the extracted information table 101 a that is stored inthe server 100 (FIG. 12), the answer transmitter 105 transmits ananswer, in which that recognized character string is specified, to theterminal 200.

On the other hand, when it is determined that the recognized characterstring is not cached (step S502; NO), the answer transmitter 105transmits an answer, in which a message is specified promptingspecification of the language for recognition for the specified area ofthat image, to the terminal 200 (step S504). For example, when arecognized character string that corresponds to the image URL of theimage 601 and the specified area that includes the specified position isnot registered in the extracted information table 101 a that is storedin the server 100, the answer transmitter 105 transmits an inquiry, inwhich a message is specified that prompts that a request in which theimage URL of the image 601, the specified area and the language forrecognition are specified be transmitted, to the terminal 200.

With this embodiment, by simply placing the mouse over an area inside animage or the like, it is possible to notify a user whether or notcharacter recognition has been completed for that area.

(12. Basic Construction of a Server of a Sixth Embodiment)

A server 100 of a sixth embodiment provides a user with whether or not aconversion processing has been completed for an area inside an image.

The server 100 of this embodiment, as illustrated in FIG. 19, includes arequest receiver 101, and a image processor 102, a response transmitter103, an inquiry receiver 104 and an answer transmitter 105. The requestreceiver 101, the image processor 102 and the response transmitter 103have the same functions as in the third embodiment. In the following,the inquiry receiver 104 and the answer transmitter 105 that havedifferent functions will be explained.

The inquiry receiver 104 receives an inquiry from the terminal 200 inwhich image ID information, a position inside that image and conversioninformation are specified.

For example, as illustrated in FIG. 22A, when a user places the cursor701 inside the image 601, the pop-up 812 is displayed as illustrated inFIG. 22B asking that the translation target language (conversioninformation) be specified. After the user selects an arbitrarytranslation target language (for example, Spanish), the terminal 200transmits an inquiry, in which the image URL of the image 601, thecoordinates “(75:175)” of the cursor 701 position and the translationtarget language “Spanish” are specified, to the sever 100. The inquiryreceiver 104 of the server 100 receives that inquiry.

When the converted character string that is associated with the IDinformation and the conversion information that are specified in thereceived inquiry, and that is associated with a specified area thatincludes the position that is specified in the received inquiry iscached by the image processor 102, the answer transmitter 105 transmitsan answer in which the cached converted character string is specified.

For example, it is presumed that an extracted information table 101 a asillustrated in FIG. 24 is stored in the server 100. The image URL“http://xxx.601.jpg” of the image 601 that was specified in the request,the specified area (recognized area 922) that includes the coordinatesof the position that was specified in the request, and the convertedcharacter string “Agua” that corresponds to the translation targetlanguage “Spanish” are registered in the extracted information table 101a. Therefore, the answer transmitter 105 transmits an answer, in whichthe converted character string is specified, to the terminal 200. Whenthe terminal 200 receives that answer, a pop-up 814 that includes theconverted character string “Agua” is displayed on the WEB page that isshown on the terminal 200 as illustrated in FIG. 25A.

On the other hand, when the converted character string, whichcorresponds to the image URL of the image 601, the specified area thatincludes the specified position, and the translation target language, isnot cached in the server 100, the answer transmitter 105 transmits aresponse, in which a message that indicates that the converted characterstring is not cached is specified, to the terminal 200. When theterminal 200 receives that answer, a pop-up 815 that includes themessage indicating that the converted character string is not cached isdisplayed on the WEB page that is shown on the terminal such asillustrated in FIG. 25B.

(13. Operation of the Server of the Sixth Embodiment)

Next, the operations that are performed by each part of the server 100of this embodiment will be explained using the flowchart in FIG. 26.After the power has been turned ON to the server 100, the CPU 501 startsthe control process that is illustrated in the flowchart in FIG. 26.

The inquiry receiver 104 receives the inquiry, in which the image IDinformation, the position inside that image, and the conversioninformation are specified, from the terminal 200 (step S601). Forexample, the inquiry receiver 104 receives the inquiry in which theimage URL of the image 601, the position “(75:175)”, and the translationtarget information “Spanish” are specified.

The answer transmitter 105 determines whether or not the convertedcharacter string, which is associated with the ID information, thespecified area that includes the position and the translation targetlanguage that were specified in the inquiry, is cached (step S602).

When it is determined that the converted character string is cashed(step S602; YES), the answer transmitter 105 transmits the answer, inwhich the cached converted character string is specified, to theterminal 200 (step S603). For example, when the converted characterstring “Agua”, which corresponds to the image URL of the image 601, thespecified area that includes the specified position “(75:175)” and thetranslation target language “Spanish”, is registered in the extractedinformation table 101 a that is stored in the server 100 (FIG. 24), theanswer transmitter 105 transmits an answer, in which that convertedcharacter string is specified, to the terminal 200.

On the other hand, when it is determined that the converted characterstring is not cached (step S602; NO), the answer transmitter 105transmits the answer, in which a message is specified indicating thatthe converted character string is not cached, to the terminal 200 (stepS604). For example, when the converted character string, whichcorresponds to the image URL of the image 601, the specified area thatincludes the specified position and the translation target language“Spanish”, is not registered in the extracted information table 101 athat is stored in the server 100, the answer transmitter 105 transmitsthe answer, in which the message is specified indicating that theconverted character string is not cached, to the terminal 200.

In this embodiment, it is also possible for the inquiry receiver 104 toreceive an inquiry in which the image URL and the position arespecified, and for the answer transmitter 105 to transmit an answer inwhich the converted character string, which corresponds to that imageURL and the specified area that includes the position, is specified.

For example, when an inquiry is received in which the image URL of theimage 601 and the specified position “(75:175)” are specified, theanswer transmitter 105 references the extracted information table 101 ain FIG. 24, and acquires the converted character strings “

(Water)”, “Agua”, “L'eau” that correspond to the image URL and thespecified areas that includes the position. The answer transmitter 105then transmits the answer, in which all of these converted characterstrings are specified, to the terminal 200. After the terminal 200receives that answer, a pop-up 816 such as illustrated in FIG. 25C isdisplayed on the terminal 200. Alternatively, the answer transmitter 105can select the converted character string that corresponds to thetranslation target language that has been specified with the highestfrequency up to that point in a request, and transmit an answer in whichthat converted character string is specified. Moreover, it is alsopossible for the answer transmitter 105 to alternatively select atrandom from the converted character strings that are registered in theextracted information table 101 a, and transmit an answer in which thatconverted character string is specified.

With this embodiment, by simply placing the mouse over an area inside animage or the like, it is possible to notify a user whether or notconversion processing (translation process) has been completed for thatarea.

(14. Basic Construction of a Terminal of a Seventh Embodiment)

A program of a seventh embodiment functions so that results of characterrecognition or conversion processing that was performed by a server aredisplayed on a terminal that is capable of communicating with theserver.

As illustrated in FIG. 27, the terminal 200 on which the program of thisembodiment operates includes a display 201, a position detector 202, aninquiry transmitter 203, an answer receiver 204, an area detector 205, asetter 206, a request transmitter 207 and a response receiver 208.

The display 201 displays on a screen an image that was identifiedaccording to ID information.

For example, the display 201, as illustrated in FIG. 22A, displays theimage 601 that is specified by the image URL “http://xxx.601.jpg”.

In this embodiment, the CPU 501 and the image processor 505 worktogether to function as the display 201.

The position detector 202 detects a selection of a position inside theimage that is displayed on the screen.

For example, when the cursor 701 is placed inside the image, theposition detector 202, as illustrated in FIG. 22A, detects thecoordinates of the position of that cursor 701.

In this embodiment, the CPU 501 and the image processor 505 worktogether to function as the position detector 202.

After the selection of the position has been detected, the inquirytransmitter 203 transmits an inquiry, in which the ID information andthe selected position are specified, to the server 100.

For example, in the case illustrated in FIG. 22A, the inquirytransmitter 203 transmits an inquiry, in which the image URL of theimage 601 and the coordinates “(75:175)” of the position are specified,to the server 100.

In this embodiment, the CPU 501 and the NIC 504 work together tofunction as the inquiry transmitter 203.

The response receiver 204 receives a response from the server 100. Thedisplay 201 then further displays the extracted information that isspecified in the received response on the screen.

For example, the answer receiver 204 receives an answer in which therecognized character string “Water” is specified. In this case, asillustrated in FIG. 22A, the display 201 displays the pop-up 811 thatincludes that recognized character string.

In this embodiment, the CPU 501 and the NIC 504 work together tofunction as the answer receiver 204.

The area detector 205 detects a selection of an area inside the imagethat is displayed on the screen.

For example, as illustrated in FIG. 9A, when the cursor 701 is draggedinside the image, the area detector 205 detects the coordinates of thedragged area (area 902).

In this embodiment, the CPU 501 and the image processor 505 worktogether to function as the area detector 205.

The setter 206 prompts setting of parameters for image processing.

For example, the setter 206 prompts setting of the language forrecognition that is used in character recognition.

In this embodiment, the CPU 501 functions as the setter 206.

After the selection of the area has been detected, the requesttransmitter 207 transmits a request, in which the ID information, theset parameter and the selected area are specified, to the server 100.

For example, as illustrated in FIG. 9A, when the area is specified andthe coordinates of that area are detected, the request transmitter 207transmits a request, in which the image URL “http://xxx.601.jpg”, thelanguage for recognition “English” and the coordinates “(5, 100):(48,200)” of the selected area are specified, to the server 100.

In this embodiment, the CPU 501 and the NIC 504 work together tofunction as the request transmitter 207.

The response receiver 208 receives a response from the server 100. Thedisplay 201 then further displays on the screen extracted informationthat is specified in the received response.

For example, the response receiver 208 receives a response in which therecognized character string “Sale” is specified. In this case, thedisplay 201, as illustrated in FIG. 13A for example, further displaysthe semi-transparent rectangle 931 over the recognized character string“Sale”.

In this embodiment, the CPU 501 and the NIC 504 work together tofunction as the response receiver 208.

(15. Operation of the Terminal of the Seventh Embodiment)

Next, the operations of each of the parts of the terminal 200 of thisembodiment will be explained using the flowchart in FIG. 28. After thepower has been turned ON to the terminal 200, the CPU 501 starts thecontrol process illustrated in FIG. 28.

The display 201 displays an image on the screen that was identified fromID information (step S701). The position detector 202 determines whetheror not the selection of the position inside the image that is displayedon the screen has been detected (step S702).

For example, the display 201 displays the image 601 for which the imageURL “http://xxx.601.jpg” is specified, and when the cursor 701 is placedover that image 601, the position detector 202 detects the coordinates“(75:175)” of the cursor 701 (FIG. 22A).

When the position detector 202 determines that selection of the positionhas been detected (step S702; YES), the inquiry transmitter 203transmits an inquiry, in which ID information and the selected positionare specified, to the server 100 (step S703). The answer receiver 204then receives an answer from the server 100 that corresponds to theinquiry (step S704). The display 201 further displays on the screenextracted information that is specified by the received answer (stepS705).

For example, the inquiry transmitter 203 transmits an inquiry, in whichthe image URL of the image 601 and the coordinates “(75:175)” of theposition are specified, to the server 100. The answer receiver 204 thenreceives an answer in which the recognized character string “Water”,which was obtained by character recognition for the area that includesthe coordinates of the position, is specified. After receiving thatanswer, the display 201, as illustrated in FIG. 22A, displays the pop-up811 in which the recognized character string is included.

On the other hand, when the position detector 202 did not determine thatthe selection of the position was detected (step S702; NO), the areadetector 205 determines whether or not the selection of the area insidethe image that is displayed on the screen was detected (step S706).

For example, when the user drags the cursor 701 inside the image asillustrated in FIG. 9A, the area detector 205 detects the coordinates“(5, 100):(48, 200)” of the area (area 902) where the cursor 701 wasdragged.

When the area detector 205 determines that the selection of the area wasdetected (step S706; YES), the setter 206 prompts setting the parameterfor image processing (step S707). The request transmitter 207 transmitsa request, in which the ID information, the set parameter and theselected area are specified, to the server 100 (step S708). The responsereceiver 208 receives a response from the server 100 that corresponds tothe request (step S709). The display 201 further displays on the screenthe extracted information that is specified in the received response(step S710).

For example, the setter 206 prompts setting of the language forrecognition that will be used in character recognition, and after thelanguage for recognition “English” has been set, the request transmitter207 transmits a request, in which the image URL of the image 601, thelanguage for recognition “English” and the coordinates “(5, 100):(48,200)” of the area are specified, to the server 100. The responsereceiver 208 then receives a response in which the recognized characterstring “Sale”, which was obtained by character recognition for an areathat overlaps with the area, is specified. After that response has beenreceived, the display 201, as illustrated in FIG. 13A for example,displays the semi-transparent rectangle 931 over the recognizedcharacter string “Sale”.

On the other hand, when the area detector 205 did not determine that theselection of the area was detected (step S706; NO), or when processingof steps S705 and S710 is complete, processing returns to step S701.

In this embodiment, it is also possible to further specify areainformation, which indicates from which area of an image that extractedinformation was extracted, in the response that is received by theanswer receiver 204. Moreover, the program of this embodiment can besuch that image ID information, extracted information and areainformation are associated and cached in the terminal 200. In this case,when the user places the mouse over an area that is included in thecached area information for an image having the same ID information asthat image, the display 201 displays the extracted information thatcorresponds to that area information on the screen.

Moreover, it is possible for the response that is received by theresponse receiver 208 to specify area information that indicates fromwhich area of an image that extracted information was extracted. Theprogram of this embodiment can cause the image ID information, theextracted information and the area information to be associated andcached in the terminal 200. In this case, when the user selects the areaof the image that has the same ID information of that image so as tooverlap the area indicated in the cached area information, the display201 displays on the screen the extracted information that corresponds tothat area information. Determining whether or not the areas overlap canbe performed, for example, in the same way as in the second embodiment.

Moreover, the program of this embodiment uses various script languagesthat are executed by a browser.

With this embodiment, it is possible to have the results of characterrecognition processing and conversion processing that are performed by aserver be displayed on a terminal that is capable of communicating withthe server, and it is possible to have characters of an image on a WEBpage be displayed in real-time on the terminal.

The embodiments described above are for explanation only and do notlimit the range of the present invention. Moreover, the presentinvention is not limited to being applied to specific technicalspecifications such as illustrated in the embodiments described above.Therefore, it is possible for a person who is skilled in the art toapply embodiments in which each element or all of the elements describedin this specification have been replaced with equivalent elements, andthose embodiments are also included within the range of the presentinvention.

The present invention is based on Japanese Patent Application No.2011-101534 filed on Apr. 28, 2011. The entire specification, claims anddrawings of Japanese Patent Application No. 2011-101534 are incorporatedin this specification by reference.

INDUSTRIAL APPLICABILITY

With the present invention, it is possible to provide a server, a servercontrol method, a program and a recording medium that are suitable fortranslating characters in an image on a WEB page in real-time.

REFERENCE SIGNS LIST

-   100 Server-   101 Request receiver-   102 Image processor-   103 Response transmitter-   104 Inquiry receiver-   105 Answer transmitter-   211, 212 to 21 n, 200 Terminal-   201 Display-   202 Position detector-   203 Inquiry transmitter-   204 Answer receiver-   205 Area detector-   206 Setter-   207 Request transmitter-   208 Response receiver-   300 Internet-   400 WEB server-   500 Information processing device-   501 CPU-   502 ROM-   503 RAM-   504 NIC-   505 Image processor-   506 Audio processor-   507 DVD-ROM drive-   508 Interface-   509 External memory-   510 Controller-   511 Monitor-   512 Speaker-   600 WEB page-   601, 602, 603 Image-   604, 605, 606 Text-   701 Cursor-   801, 803, 806, 807, 809, 810, 811, 812, 813, 814, 815, 816 Pop-up-   802, 804, 805 Button-   901, 931, 932, 933, 941 Rectangle-   902, 903, 904, 905, 906, 907, 908, 909, 910 Area-   911, 912 Corner-   921, 922, 923 Recognized area

The invention claimed is:
 1. A server comprising: a request receiverthat receives a request in which image ID information and a parameterfor image processing are specified, from a terminal; an image processorthat acquires an image based on the ID information that was specified inthe received request, and by using the parameter that was specified inthe received request to apply the image processing on the acquiredimage, outputs extracted information that was extracted from inside theimage; and a response transmitter that transmits a response in which theextracted information that was outputted by the image processor isspecified, to the terminal; wherein after the extracted information hasbeen extracted, the image processor associates and caches the extractedinformation with the image ID information and the parameter; when theextracted information that is associated with the ID information and theparameter that were specified in the received request is cached, theimage processor, instead of acquiring the image and applying the imageprocessing, obtains the cached extracted information, and outputs theobtained extracted information; the image processing by the imageprocessor includes performing character recognition; and the extractedinformation comprises a recognized character string corresponding to thecharacter recognition.
 2. The server according to claim 1, wherein theparameter that is specified in the request includes a specified area;the image processing by the image processor includes extractionprocessing that, by performing the character recognition on thespecified area inside the image that is included in the parameter,outputs the recognized character string that was recognized from thespecified area in the image; after the recognized character string hasbeen extracted, the image processor associates and caches the recognizedcharacter string with the image ID information and the specified area;when the recognized character string that is associated with the IDinformation and the specified area included in the parameter that arespecified in the received request is cached, the image processor,instead of the character recognition, obtains the cached recognizedcharacter string, and outputs the obtained recognized character string;and the extracted information includes the recognized character string.3. The server according to claim 2, wherein when the ID information thatwas associated when the recognized character string was cached matchesthe ID information that was specified in the received request, and thespecified area that was associated when the recognized character stringwas cached overlaps the specified area that is included in the parameterthat was specified in the received request, the image processordetermines that the extracted information that is associated with the IDinformation and the specified area included in the parameter that werespecified in the received request is cached.
 4. The server according toclaim 2 or 3, wherein when the recognized character string is extracted,the image processor caches the recognized character string aftercorrecting the specified area to a recognized area in the specified areawhere the recognized character string was recognized.
 5. The serveraccording to claim 2, wherein the parameter that is specified in therequest includes a language for recognition; and the characterrecognition by the image processor is performed using the language forrecognition.
 6. The server according to claim 5, wherein the parameterthat is specified in the request includes conversion information; theimage processing by the image processor includes conversion processingthat, by performing conversion processing on the outputted recognizedcharacter string using the conversion information that is included inthe parameter that was specified in the received request, outputs aconverted character string that is converted from the recognizedcharacter string; and the extracted information includes the convertedcharacter string.
 7. The server according to claim 6, wherein when theconverted character string is extracted, the image processor associatesand caches the converted character string with the image ID information,the specified area, the language for recognition, the recognizedcharacter string, and the conversion information; and when the convertedcharacter string is cached for the ID information, the specified area,the language for recognition and the conversion information that werespecified in the received request, the image processor, instead of theconversion processing, obtains the cached converted character string,and outputs the obtained converted character string.
 8. The serveraccording to claim 6, wherein the conversion processing obtains theconverted character string by translating the recognized characterstring to a translation target language that is specified in theconversion information.
 9. The server according to claim 6, furthercomprising: an inquiry receiver that receives an inquiry in which theimage ID information, a position inside the image and the conversioninformation are specified, from the terminal; and an answer transmitterthat, when the converted character string that is associated with the IDinformation that is specified in the received inquiry, the conversioninformation, and the specified area that includes the position specifiedin the received inquiry is cached by the image processor, transmits ananswer in which the cached converted character string is specified, tothe terminal.
 10. The server according to claim 2, further comprising:an inquiry receiver that receives an inquiry in which the image IDinformation and a position inside the image are specified, from theterminal; and an answer transmitter that, when the recognized characterstring that is associated with the ID information that is specified inthe received inquiry and a specified area that includes the positionthat is specified in the received inquiry is cached by the imageprocessor, transmits an answer in which the cached recognized characterstring is specified, to the terminal.
 11. The server according to claim1, further comprising: an inquiry receiver that receives an inquiry inwhich the image ID information is specified, from the terminal; and ananswer transmitter that, when the extracted information that isassociated with the ID information that is specified in the receivedinquiry is not cached, transmits an answer in which a message isspecified that prompts transmission of the request in which the image IDinformation and the parameter for image processing are specified, to theterminal, and when the extracted information that is associated with theID information that is specified in the received inquiry is cached,transmits an answer in which the extracted information is specified, tothe terminal.
 12. A computer comprising: a display that displays animage identified by the ID information on a screen and that is capableof communicating with the server according to claim 10; at least onenon-transitory memory operable to store program code; and at least oneprocessor operable to read the program code and operate as instructed bythe program code, the program causing the computer to function as: aposition detector that detects a selection of a position inside theimage that is displayed on the screen; an inquiry transmitter that, whenthe selection of the position is detected, transmits an inquiry in whichthe ID information and the selected position are specified, to a server;and an answer receiver that receives an answer from the server; theprogram causing the computer to function such that the display furtherdisplays on the screen the extracted information that is specified inthe received answer; the program causing the computer to function as: anarea detector that detects a selection of an area inside the image thatis displayed on the screen; a setter that prompts setting the parameterfor image processing; a request transmitter that, when the selection ofthe area is detected, transmits a request in which the ID information,the set parameter, and the selected area are specified, to the server;and a response receiver that receives a response from the server; andthe program causing the computer to function such that the displayfurther displays on the screen the extracted information that isspecified in the received response.
 13. A control method for a server,comprising: a request receiving step for receiving a request in whichimage ID information and a parameter for image processing are specified,from a terminal; an image processing step for acquiring an image basedon the ID information that was specified in the received request, and byusing the parameter that was specified in the received request to applythe image processing on the acquired image, outputting extractedinformation that was extracted from inside the image; a responsetransmission step for transmitting a response in which the extractedinformation that was outputted is specified, to the terminal; whereinthe image processing step includes after the extracted information hasbeen extracted, associating and caching the extracted information withthe image ID information and the parameter; when the extractedinformation that is associated with the ID information and the parameterthat were specified in the received request is cached, instead ofacquiring the image and applying the image processing, obtaining thecached extracted information, and outputting the obtained extractedinformation; the image processing step includes character recognition;and the extracted information comprises a recognized character stringcorresponding to the character recognition.
 14. A non-transitoryrecording medium that is readable by a computer and that records aprogram that causes the computer to function as: a request receiver thatreceives a request in which image ID information and a parameter forimage processing are specified, from a terminal; an image processor thatacquires an image based on the ID information that was specified in thereceived request, and by using the parameter that was specified in thereceived request to apply the image processing on the acquired image,outputs extracted information that was extracted from inside the image;and a response transmitter that transmits a response in which theextracted information that was outputted by the image processor isspecified, to the terminal; wherein after the extracted information hasbeen extracted, the image processor associates and caches the extractedinformation with the image ID information and the parameter; when theextracted information that is associated with the ID information and theparameter that were specified in the received request is cached, theimage processor, instead of acquiring the image and applying the imageprocessing, obtains the cached extracted information, and outputs theobtained extracted information; the image processing by the imageprocessor includes performing character recognition; and the extractedinformation comprises a recognized character string corresponding to thecharacter recognition.